Overview
Brought to you by YData
Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 1125 |
| Missing cells | 149 |
| Missing cells (%) | 1.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 114.4 KiB |
| Average record size in memory | 104.1 B |
Variable types
| Categorical | 4 |
|---|---|
| Numeric | 9 |
fea_1 is highly overall correlated with fea_6 | High correlation |
fea_6 is highly overall correlated with fea_1 and 1 other fields | High correlation |
id is highly overall correlated with fea_6 | High correlation |
fea_5 is highly imbalanced (63.0%) | Imbalance |
fea_2 has 149 (13.2%) missing values | Missing |
id has unique values | Unique |
Reproduction
| Analysis started | 2025-03-12 01:56:30.318391 |
|---|---|
| Analysis finished | 2025-03-12 01:56:35.363058 |
| Duration | 5.04 seconds |
| Software version | ydata-profiling vv4.13.0 |
| Download configuration | config.json |
Variables
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 900 | |
| 1 | 225 | 20.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 900 | |
| 1 | 225 | 20.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 900 | |
| 1 | 225 | 20.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 900 | |
| 1 | 225 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 900 | |
| 1 | 225 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 900 | |
| 1 | 225 | 20.0% |
id
Real number (ℝ)
High correlation  Unique 
| Distinct | 1125 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57836771 |
| Minimum | 54982353 |
|---|---|
| Maximum | 59006239 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 54982353 |
|---|---|
| 5-th percentile | 54984218 |
| Q1 | 54990497 |
| median | 58989748 |
| Q3 | 58997994 |
| 95-th percentile | 59004536 |
| Maximum | 59006239 |
| Range | 4023886 |
| Interquartile range (IQR) | 4007497 |
Descriptive statistics
| Standard deviation | 1817150.4 |
|---|---|
| Coefficient of variation (CV) | 0.0314186 |
| Kurtosis | -1.1319137 |
| Mean | 57836771 |
| Median Absolute Deviation (MAD) | 9940 |
| Skewness | -0.93276429 |
| Sum | 6.5066368 × 1010 |
| Variance | 3.3020355 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54982665 | 1 | 0.1% |
| 58991343 | 1 | 0.1% |
| 54988970 | 1 | 0.1% |
| 54991614 | 1 | 0.1% |
| 58989779 | 1 | 0.1% |
| 59003701 | 1 | 0.1% |
| 59005448 | 1 | 0.1% |
| 58999966 | 1 | 0.1% |
| 58986443 | 1 | 0.1% |
| 58984421 | 1 | 0.1% |
| Other values (1115) | 1115 |
| Value | Count | Frequency (%) |
| 54982353 | 1 | |
| 54982356 | 1 | |
| 54982387 | 1 | |
| 54982463 | 1 | |
| 54982530 | 1 | |
| 54982549 | 1 | |
| 54982579 | 1 | |
| 54982665 | 1 | |
| 54982697 | 1 | |
| 54982721 | 1 |
| Value | Count | Frequency (%) |
| 59006239 | 1 | |
| 59006219 | 1 | |
| 59006193 | 1 | |
| 59006139 | 1 | |
| 59005995 | 1 | |
| 59005917 | 1 | |
| 59005881 | 1 | |
| 59005880 | 1 | |
| 59005871 | 1 | |
| 59005860 | 1 |
fea_1
Real number (ℝ)
High correlation 
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.4826667 |
| Minimum | 1 |
|---|---|
| Maximum | 7 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 4 |
| median | 5 |
| Q3 | 7 |
| 95-th percentile | 7 |
| Maximum | 7 |
| Range | 6 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.3833375 |
|---|---|
| Coefficient of variation (CV) | 0.25231108 |
| Kurtosis | -1.1810284 |
| Mean | 5.4826667 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.10437923 |
| Sum | 6168 |
| Variance | 1.9136228 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 476 | |
| 4 | 377 | |
| 5 | 261 | |
| 1 | 7 | 0.6% |
| 6 | 2 | 0.2% |
| 2 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 7 | 0.6% |
| 2 | 2 | 0.2% |
| 4 | 377 | |
| 5 | 261 | |
| 6 | 2 | 0.2% |
| 7 | 476 |
| Value | Count | Frequency (%) |
| 7 | 476 | |
| 6 | 2 | 0.2% |
| 5 | 261 | |
| 4 | 377 | |
| 2 | 2 | 0.2% |
| 1 | 7 | 0.6% |
fea_2
Real number (ℝ)
Missing 
| Distinct | 158 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 149 |
| Missing (%) | 13.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1283.9114 |
| Minimum | 1116.5 |
|---|---|
| Maximum | 1481 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 1116.5 |
|---|---|
| 5-th percentile | 1214 |
| Q1 | 1244 |
| median | 1281.5 |
| Q3 | 1314.5 |
| 95-th percentile | 1371.5 |
| Maximum | 1481 |
| Range | 364.5 |
| Interquartile range (IQR) | 70.5 |
Descriptive statistics
| Standard deviation | 51.764022 |
|---|---|
| Coefficient of variation (CV) | 0.040317441 |
| Kurtosis | 0.58544031 |
| Mean | 1283.9114 |
| Median Absolute Deviation (MAD) | 36 |
| Skewness | 0.41207622 |
| Sum | 1253097.5 |
| Variance | 2679.5139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1241 | 37 | 3.3% |
| 1214 | 27 | 2.4% |
| 1305.5 | 23 | 2.0% |
| 1287.5 | 21 | 1.9% |
| 1223 | 21 | 1.9% |
| 1304 | 20 | 1.8% |
| 1257.5 | 19 | 1.7% |
| 1266.5 | 19 | 1.7% |
| 1272.5 | 18 | 1.6% |
| 1239.5 | 17 | 1.5% |
| Other values (148) | 754 | |
| (Missing) | 149 | 13.2% |
| Value | Count | Frequency (%) |
| 1116.5 | 1 | |
| 1125.5 | 1 | |
| 1130 | 1 | |
| 1137.5 | 1 | |
| 1148 | 1 | |
| 1163 | 1 | |
| 1164.5 | 1 | |
| 1166 | 1 | |
| 1170.5 | 2 | |
| 1179.5 | 2 |
| Value | Count | Frequency (%) |
| 1481 | 1 | 0.1% |
| 1475 | 1 | 0.1% |
| 1469 | 2 | |
| 1455.5 | 1 | 0.1% |
| 1449.5 | 1 | 0.1% |
| 1443.5 | 2 | |
| 1425.5 | 1 | 0.1% |
| 1419.5 | 2 | |
| 1415 | 3 | |
| 1413.5 | 1 | 0.1% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 684 | |
| 1 | 309 | |
| 2 | 132 | 11.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 684 | |
| 1 | 309 | |
| 2 | 132 | 11.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 684 | |
| 1 | 309 | |
| 2 | 132 | 11.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 3 | 684 | |
| 1 | 309 | |
| 2 | 132 | 11.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 3 | 684 | |
| 1 | 309 | |
| 2 | 132 | 11.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 3 | 684 | |
| 1 | 309 | |
| 2 | 132 | 11.7% |
fea_4
Real number (ℝ)
| Distinct | 229 |
|---|---|
| Distinct (%) | 20.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 120883.56 |
| Minimum | 15000 |
|---|---|
| Maximum | 1200000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 15000 |
|---|---|
| 5-th percentile | 39000 |
| Q1 | 72000 |
| median | 102000 |
| Q3 | 139000 |
| 95-th percentile | 282000 |
| Maximum | 1200000 |
| Range | 1185000 |
| Interquartile range (IQR) | 67000 |
Descriptive statistics
| Standard deviation | 88445.229 |
|---|---|
| Coefficient of variation (CV) | 0.73165641 |
| Kurtosis | 32.210409 |
| Mean | 120883.56 |
| Median Absolute Deviation (MAD) | 32000 |
| Skewness | 4.174755 |
| Sum | 1.35994 × 108 |
| Variance | 7.8225585 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 35000 | 34 | 3.0% |
| 50000 | 25 | 2.2% |
| 90000 | 19 | 1.7% |
| 150000 | 19 | 1.7% |
| 110000 | 18 | 1.6% |
| 100000 | 18 | 1.6% |
| 130000 | 17 | 1.5% |
| 68000 | 16 | 1.4% |
| 56000 | 15 | 1.3% |
| 71000 | 15 | 1.3% |
| Other values (219) | 929 |
| Value | Count | Frequency (%) |
| 15000 | 2 | 0.2% |
| 30000 | 14 | |
| 34000 | 1 | 0.1% |
| 35000 | 34 | |
| 38000 | 3 | 0.3% |
| 39000 | 4 | 0.4% |
| 41000 | 1 | 0.1% |
| 42000 | 2 | 0.2% |
| 43000 | 1 | 0.1% |
| 44000 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 1200000 | 1 | 0.1% |
| 1000000 | 1 | 0.1% |
| 550000 | 1 | 0.1% |
| 546000 | 1 | 0.1% |
| 500000 | 8 | |
| 489000 | 1 | 0.1% |
| 488000 | 1 | 0.1% |
| 483000 | 1 | 0.1% |
| 468000 | 1 | 0.1% |
| 458000 | 1 | 0.1% |
fea_5
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 8.9 KiB |
| 2 | |
|---|---|
| 1 | 80 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 1045 | |
| 1 | 80 | 7.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 1045 | |
| 1 | 80 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1045 | |
| 1 | 80 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1045 | |
| 1 | 80 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1045 | |
| 1 | 80 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 1045 | |
| 1 | 80 | 7.1% |
fea_6
Real number (ℝ)
High correlation 
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.872 |
| Minimum | 3 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 8 |
| median | 11 |
| Q3 | 11 |
| 95-th percentile | 15 |
| Maximum | 16 |
| Range | 13 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.6764373 |
|---|---|
| Coefficient of variation (CV) | 0.24617709 |
| Kurtosis | -0.85633027 |
| Mean | 10.872 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 0.30198963 |
| Sum | 12231 |
| Variance | 7.1633167 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 11 | 465 | |
| 8 | 375 | |
| 15 | 259 | |
| 12 | 11 | 1.0% |
| 4 | 4 | 0.4% |
| 5 | 3 | 0.3% |
| 6 | 2 | 0.2% |
| 9 | 2 | 0.2% |
| 3 | 2 | 0.2% |
| 16 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 3 | 2 | 0.2% |
| 4 | 4 | 0.4% |
| 5 | 3 | 0.3% |
| 6 | 2 | 0.2% |
| 8 | 375 | |
| 9 | 2 | 0.2% |
| 11 | 465 | |
| 12 | 11 | 1.0% |
| 15 | 259 | |
| 16 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 16 | 2 | 0.2% |
| 15 | 259 | |
| 12 | 11 | 1.0% |
| 11 | 465 | |
| 9 | 2 | 0.2% |
| 8 | 375 | |
| 6 | 2 | 0.2% |
| 5 | 3 | 0.3% |
| 4 | 4 | 0.4% |
| 3 | 2 | 0.2% |
fea_7
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.8328889 |
| Minimum | -1 |
|---|---|
| Maximum | 10 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 170 |
| Negative (%) | 15.1% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | 5 |
| median | 5 |
| Q3 | 5 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 11 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.9711817 |
|---|---|
| Coefficient of variation (CV) | 0.61478379 |
| Kurtosis | 0.053229923 |
| Mean | 4.8328889 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.60681621 |
| Sum | 5437 |
| Variance | 8.8279209 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 689 | |
| 9 | 212 | 18.8% |
| -1 | 170 | 15.1% |
| 2 | 17 | 1.5% |
| 8 | 9 | 0.8% |
| 3 | 9 | 0.8% |
| 4 | 7 | 0.6% |
| 7 | 6 | 0.5% |
| 10 | 5 | 0.4% |
| 1 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| -1 | 170 | 15.1% |
| 1 | 1 | 0.1% |
| 2 | 17 | 1.5% |
| 3 | 9 | 0.8% |
| 4 | 7 | 0.6% |
| 5 | 689 | |
| 7 | 6 | 0.5% |
| 8 | 9 | 0.8% |
| 9 | 212 | 18.8% |
| 10 | 5 | 0.4% |
| Value | Count | Frequency (%) |
| 10 | 5 | 0.4% |
| 9 | 212 | 18.8% |
| 8 | 9 | 0.8% |
| 7 | 6 | 0.5% |
| 5 | 689 | |
| 4 | 7 | 0.6% |
| 3 | 9 | 0.8% |
| 2 | 17 | 1.5% |
| 1 | 1 | 0.1% |
| -1 | 170 | 15.1% |
fea_8
Real number (ℝ)
| Distinct | 52 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.80267 |
| Minimum | 64 |
|---|---|
| Maximum | 115 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 64 |
|---|---|
| 5-th percentile | 80 |
| Q1 | 90 |
| median | 105 |
| Q3 | 111 |
| 95-th percentile | 114 |
| Maximum | 115 |
| Range | 51 |
| Interquartile range (IQR) | 21 |
Descriptive statistics
| Standard deviation | 11.988955 |
|---|---|
| Coefficient of variation (CV) | 0.1189349 |
| Kurtosis | -0.28496104 |
| Mean | 100.80267 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | -0.83900783 |
| Sum | 113403 |
| Variance | 143.73505 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 110 | 98 | 8.7% |
| 112 | 93 | 8.3% |
| 100 | 80 | 7.1% |
| 113 | 70 | 6.2% |
| 114 | 65 | 5.8% |
| 105 | 58 | 5.2% |
| 111 | 52 | 4.6% |
| 109 | 42 | 3.7% |
| 90 | 42 | 3.7% |
| 107 | 39 | 3.5% |
| Other values (42) | 486 |
| Value | Count | Frequency (%) |
| 64 | 7 | |
| 65 | 1 | 0.1% |
| 66 | 1 | 0.1% |
| 67 | 2 | 0.2% |
| 68 | 2 | 0.2% |
| 69 | 1 | 0.1% |
| 70 | 2 | 0.2% |
| 71 | 2 | 0.2% |
| 72 | 2 | 0.2% |
| 73 | 3 |
| Value | Count | Frequency (%) |
| 115 | 15 | 1.3% |
| 114 | 65 | |
| 113 | 70 | |
| 112 | 93 | |
| 111 | 52 | |
| 110 | 98 | |
| 109 | 42 | |
| 108 | 28 | 2.5% |
| 107 | 39 | 3.5% |
| 106 | 18 | 1.6% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 5 |
|---|---|
| 2nd row | 3 |
| 3rd row | 5 |
| 4th row | 3 |
| 5th row | 4 |
Common Values
| Value | Count | Frequency (%) |
| 5 | 521 | |
| 4 | 318 | |
| 3 | 278 | |
| 1 | 7 | 0.6% |
| 2 | 1 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 5 | 521 | |
| 4 | 318 | |
| 3 | 278 | |
| 1 | 7 | 0.6% |
| 2 | 1 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | 521 | |
| 4 | 318 | |
| 3 | 278 | |
| 1 | 7 | 0.6% |
| 2 | 1 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 5 | 521 | |
| 4 | 318 | |
| 3 | 278 | |
| 1 | 7 | 0.6% |
| 2 | 1 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 5 | 521 | |
| 4 | 318 | |
| 3 | 278 | |
| 1 | 7 | 0.6% |
| 2 | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1125 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 5 | 521 | |
| 4 | 318 | |
| 3 | 278 | |
| 1 | 7 | 0.6% |
| 2 | 1 | 0.1% |
fea_10
Real number (ℝ)
| Distinct | 280 |
|---|---|
| Distinct (%) | 24.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 164618.5 |
| Minimum | 60000 |
|---|---|
| Maximum | 650070 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 60000 |
|---|---|
| 5-th percentile | 60005 |
| Q1 | 60044 |
| median | 72000 |
| Q3 | 151307 |
| 95-th percentile | 450081 |
| Maximum | 650070 |
| Range | 590070 |
| Interquartile range (IQR) | 91263 |
Descriptive statistics
| Standard deviation | 152520.49 |
|---|---|
| Coefficient of variation (CV) | 0.92650882 |
| Kurtosis | 0.14193148 |
| Mean | 164618.5 |
| Median Absolute Deviation (MAD) | 11984 |
| Skewness | 1.2698434 |
| Sum | 1.8519581 × 108 |
| Variance | 2.3262499 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 151300 | 128 | 11.4% |
| 72000 | 103 | 9.2% |
| 72001 | 40 | 3.6% |
| 60000 | 26 | 2.3% |
| 60019 | 23 | 2.0% |
| 60091 | 23 | 2.0% |
| 71000 | 21 | 1.9% |
| 60018 | 17 | 1.5% |
| 60014 | 17 | 1.5% |
| 60036 | 17 | 1.5% |
| Other values (270) | 710 |
| Value | Count | Frequency (%) |
| 60000 | 26 | |
| 60001 | 10 | 0.9% |
| 60002 | 5 | 0.4% |
| 60004 | 10 | 0.9% |
| 60005 | 11 | |
| 60006 | 3 | 0.3% |
| 60007 | 5 | 0.4% |
| 60008 | 3 | 0.3% |
| 60010 | 1 | 0.1% |
| 60011 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 650070 | 1 | |
| 650018 | 1 | |
| 650005 | 1 | |
| 591044 | 1 | |
| 591017 | 1 | |
| 591003 | 1 | |
| 591001 | 1 | |
| 552104 | 2 | |
| 551201 | 1 | |
| 550115 | 1 |
fea_11
Real number (ℝ)
| Distinct | 266 |
|---|---|
| Distinct (%) | 23.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 134.999 |
| Minimum | 1 |
|---|---|
| Maximum | 707.10678 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 8.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 173.20508 |
| Q3 | 202.48457 |
| 95-th percentile | 291.47586 |
| Maximum | 707.10678 |
| Range | 706.10678 |
| Interquartile range (IQR) | 201.48457 |
Descriptive statistics
| Standard deviation | 112.6168 |
|---|---|
| Coefficient of variation (CV) | 0.83420465 |
| Kurtosis | 0.7591074 |
| Mean | 134.999 |
| Median Absolute Deviation (MAD) | 50.401717 |
| Skewness | 0.36524058 |
| Sum | 151873.88 |
| Variance | 12682.543 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 407 | |
| 200 | 101 | 9.0% |
| 173.2050808 | 81 | 7.2% |
| 223.6067977 | 50 | 4.4% |
| 187.0828693 | 46 | 4.1% |
| 158.113883 | 33 | 2.9% |
| 212.1320344 | 30 | 2.7% |
| 316.227766 | 18 | 1.6% |
| 244.9489743 | 14 | 1.2% |
| 204.9390153 | 11 | 1.0% |
| Other values (256) | 334 |
| Value | Count | Frequency (%) |
| 1 | 407 | |
| 3.16227766 | 1 | 0.1% |
| 105.9008971 | 1 | 0.1% |
| 118.6844556 | 1 | 0.1% |
| 122.4744871 | 1 | 0.1% |
| 134.9555482 | 1 | 0.1% |
| 141.4213562 | 6 | 0.5% |
| 145.6021978 | 1 | 0.1% |
| 153.2677396 | 1 | 0.1% |
| 153.3916556 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 707.1067812 | 1 | |
| 692.820323 | 1 | |
| 632.455532 | 1 | |
| 626.8971207 | 1 | |
| 547.7225575 | 1 | |
| 538.5387637 | 1 | |
| 500 | 1 | |
| 492.6601263 | 1 | |
| 445.0382006 | 1 | |
| 444.4738462 | 1 |
Interactions
Correlations
| fea_1 | fea_10 | fea_11 | fea_2 | fea_3 | fea_4 | fea_5 | fea_6 | fea_7 | fea_8 | fea_9 | id | label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| fea_1 | 1.000 | 0.102 | 0.087 | -0.007 | 0.151 | -0.010 | 0.038 | 0.550 | -0.045 | 0.034 | 0.063 | -0.352 | 0.020 |
| fea_10 | 0.102 | 1.000 | 0.207 | -0.034 | 0.148 | -0.040 | 0.096 | 0.162 | -0.148 | 0.163 | 0.135 | -0.085 | 0.000 |
| fea_11 | 0.087 | 0.207 | 1.000 | 0.072 | 0.160 | 0.084 | 0.158 | 0.135 | 0.033 | 0.105 | 0.029 | -0.073 | 0.000 |
| fea_2 | -0.007 | -0.034 | 0.072 | 1.000 | 0.311 | 0.473 | 0.000 | -0.008 | -0.000 | -0.006 | 0.055 | 0.005 | 0.072 |
| fea_3 | 0.151 | 0.148 | 0.160 | 0.311 | 1.000 | 0.141 | 0.000 | 0.146 | 0.217 | 0.043 | 0.133 | 0.000 | 0.071 |
| fea_4 | -0.010 | -0.040 | 0.084 | 0.473 | 0.141 | 1.000 | 0.000 | -0.087 | 0.023 | -0.076 | 0.085 | -0.010 | 0.086 |
| fea_5 | 0.038 | 0.096 | 0.158 | 0.000 | 0.000 | 0.000 | 1.000 | 0.027 | 0.070 | 0.186 | 0.000 | 0.031 | 0.000 |
| fea_6 | 0.550 | 0.162 | 0.135 | -0.008 | 0.146 | -0.087 | 0.027 | 1.000 | -0.040 | 0.041 | 0.143 | -0.571 | 0.000 |
| fea_7 | -0.045 | -0.148 | 0.033 | -0.000 | 0.217 | 0.023 | 0.070 | -0.040 | 1.000 | 0.093 | 0.059 | 0.006 | 0.000 |
| fea_8 | 0.034 | 0.163 | 0.105 | -0.006 | 0.043 | -0.076 | 0.186 | 0.041 | 0.093 | 1.000 | 0.275 | 0.013 | 0.000 |
| fea_9 | 0.063 | 0.135 | 0.029 | 0.055 | 0.133 | 0.085 | 0.000 | 0.143 | 0.059 | 0.275 | 1.000 | 0.000 | 0.000 |
| id | -0.352 | -0.085 | -0.073 | 0.005 | 0.000 | -0.010 | 0.031 | -0.571 | 0.006 | 0.013 | 0.000 | 1.000 | 0.000 |
| label | 0.020 | 0.000 | 0.000 | 0.072 | 0.071 | 0.086 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 |
Missing values
Sample
| label | id | fea_1 | fea_2 | fea_3 | fea_4 | fea_5 | fea_6 | fea_7 | fea_8 | fea_9 | fea_10 | fea_11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 54982665 | 5 | 1245.5 | 3 | 77000.0 | 2 | 15 | 5 | 109 | 5 | 151300 | 244.948974 |
| 1 | 0 | 59004779 | 4 | 1277.0 | 1 | 113000.0 | 2 | 8 | -1 | 100 | 3 | 341759 | 207.173840 |
| 2 | 0 | 58990862 | 7 | 1298.0 | 1 | 110000.0 | 2 | 11 | -1 | 101 | 5 | 72001 | 1.000000 |
| 3 | 1 | 58995168 | 7 | 1335.5 | 1 | 151000.0 | 2 | 11 | 5 | 110 | 3 | 60084 | 1.000000 |
| 4 | 0 | 54987320 | 7 | NaN | 2 | 59000.0 | 2 | 11 | 5 | 108 | 4 | 450081 | 197.403141 |
| 5 | 0 | 59005995 | 6 | 1217.0 | 3 | 56000.0 | 2 | 6 | -1 | 100 | 3 | 60091 | 1.000000 |
| 6 | 1 | 59001917 | 4 | 1304.0 | 3 | 35000.0 | 2 | 8 | 9 | 85 | 5 | 60069 | 1.000000 |
| 7 | 1 | 54984789 | 5 | 1256.0 | 3 | 78000.0 | 2 | 15 | -1 | 111 | 3 | 60030 | 1.000000 |
| 8 | 0 | 58984557 | 5 | 1323.5 | 3 | 218000.0 | 2 | 15 | 5 | 112 | 4 | 151300 | 282.842713 |
| 9 | 0 | 54990497 | 4 | NaN | 2 | 35000.0 | 2 | 8 | 5 | 101 | 3 | 60029 | 237.301496 |
| label | id | fea_1 | fea_2 | fea_3 | fea_4 | fea_5 | fea_6 | fea_7 | fea_8 | fea_9 | fea_10 | fea_11 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1115 | 0 | 58996837 | 7 | 1235.0 | 3 | 56000.0 | 2 | 11 | -1 | 114 | 4 | 151300 | 206.888859 |
| 1116 | 0 | 54989264 | 4 | 1343.0 | 3 | 110000.0 | 2 | 8 | 2 | 105 | 5 | 60043 | 1.000000 |
| 1117 | 0 | 59001031 | 4 | NaN | 2 | 58000.0 | 2 | 8 | 5 | 100 | 5 | 151300 | 196.214169 |
| 1118 | 0 | 58992063 | 7 | 1137.5 | 3 | 88000.0 | 2 | 11 | -1 | 107 | 4 | 450081 | 158.113883 |
| 1119 | 0 | 54985816 | 7 | 1320.5 | 3 | 108000.0 | 2 | 11 | 5 | 110 | 4 | 510068 | 248.997992 |
| 1120 | 0 | 58988196 | 5 | 1289.0 | 1 | 173000.0 | 2 | 15 | 5 | 112 | 3 | 350702 | 200.000000 |
| 1121 | 0 | 58987926 | 5 | NaN | 2 | 50000.0 | 2 | 15 | 5 | 108 | 4 | 450000 | 169.000000 |
| 1122 | 0 | 58995381 | 7 | 1220.0 | 3 | 76000.0 | 2 | 11 | 2 | 90 | 5 | 71002 | 1.000000 |
| 1123 | 0 | 58998054 | 4 | 1250.0 | 3 | 137000.0 | 2 | 8 | 5 | 90 | 5 | 72000 | 1.000000 |
| 1124 | 0 | 54989781 | 4 | 1415.0 | 3 | 93000.0 | 2 | 8 | 5 | 113 | 4 | 151300 | 273.861279 |